Monstrous Moonshine: The first twenty- five years 



Terry Gannon 
Abstract 

Twenty-five years ago, Conway and Norton published in this journal^ their remarkable paper 'Monstrous Moon- 
shine', proposing a completely unexpected relationship between finite simple groups and modular functions. This 
paper reviews the progress made in broadening and understanding that relationship. 

1. Introduction 

It has been approximately twenty-five years since John McKay remarked that 

196 884 = 196 883 + 1. (1.1) 

That time has seen the discovery of important structures, the establishment of another 
deep connection between number theory and algebra, and a reinforcement of a new era 
of cooperation between pure mathematics and mathematical physics. It is a beautiful 
and accessible example of how mathematics can be driven by strictly conceptual concerns, 
and of how the particular and the general can feed off each other. Now, six years after 
Borcherds' Fields Medal, the original flurry of activity is over; the new period should be 
one of consolidation and generalisation and should witness the gradual movement of this 
still rather esoteric corner of mathematics toward the mainstream. 

The central question McKay's equation (1.1) raises, is: What does the j-function (the 
left side) have to do with the Monster finite group (the right side)? Many would argue that 
we still don't have our finger on the essence of the matter. But what is clear is that we 
understand far more about this central question today than we did in 1978. Today we say 
that there is a vertex operator algebra, called the Moonshine module V\ which interpolates 
between the left and right sides of (1.1): its automorphism group is the Monster and its 
graded dimension is the j-function (—744). 

This paper tries to summarise this work of the past twenty-five years in about as many 
pages. The original article [24] is still very readable and contains a wealth of information 
not found in other sources. Other reviews are [21], [87], [12], [39], [90], [48], [15], [78], 
[102], [16], [46] and the introductory chapter in [44], and each has its own emphasis. Our 
own bias here has been to breadth at the expense of depth, which probably limits this 
review to be a mere annotated sampling of representative literature. 



2. Background 

In §2.1 we describe the finite simple groups and in particular the Monster. In §2.2 we 
focus on the modular groups and functions which arise in Monstrous Moonshine. 

2.1. The Monster. By definition, a simple group is one whose only normal subgroups 
are the trivial ones: {1} and the group itself. The importance of the finite simple groups 
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lies in their role as building blocks, in the sense that any finite group G can be constructed 
from {1} by extending successively by a (unique up to order) sequence of finite simple 
groups. For example the symmetric group ^'4 arises in this way from the cyclic groups 
C2, C2, C3, C2. 

A formidable accomplishment last century was determining the explicit list of all finite 
simple groups. See e.g. [49] for more details and references. These groups are: 

(i) the cyclic groups Cp, p prime; 

(ii) the alternating groups A^, n > 5; 

(iii) 16 infinite families of groups of Lie type; 

(iv) 26 sporadic groups. 

An example of a family of finite simple group of Lie type is PSL„(Fg), i.e. the group 
of n X n matrices with determinant 1, and entries from the finite field Fg, quotiented out 
by its centre (the scalar matrices al, where a'^ — 1). 

The mysteriousness of the sporadics is due to their falling outside those infinite fam- 
ilies. They range in size from the Mathieu group Mn, with order 7920 and discovered in 
1861, to the Monster M, with order 

|M| = 2^^ • 3^° • 5^ • 7^ • 11^ • 13^ • 17 • 19 • 23 • 29 • 31 • 41 • 47 • 59 • 71 f» 8 X 10^^ . (2.1) 

The existence of M was conjectured in 1973 by Fischer and Griess, and finally constructed 
in 1980 by Griess [50]. Most sporadics arise in M (e.g. as quotients of subgroups). We'll 
encounter many of these sporadics in the coming pages, but most of our attention will be 
directed at M. 

Griess showed in fact that M was the automorphism group of a 196883-dimensional 
commutative nonassociative algebra, now called the Griess algebra, but the construction 
was somewhat artificial. We now understand [44] the Griess algebra as the first nontrivial 
tier of an infinite-dimensional graded algebra, the Moonshine module V\ which lies at 
the heart of Monstrous Moonshine. We'll discuss V'' in §4.2; we will find that it has a 
very rich algebraic structure, is conjectured to obey a strong uniqueness property, and has 
automorphism group M. 

The Monster M has a remarkably simple presentation. As with any noncyclic finite 
simple group, it is generated by its involutions (i.e. elements of order 2) and so will be a 
homomorphic image of a Coxeter group. Let Qpqr, p > q > r > 2,he the graph consisting of 
three strands of lengths p + l,q + l,r + l, sharing a common endpoint . Label the p+q+r + 1 
nodes as in Figure 1. Given any graph Qpqr, define Ypgr to be the group consisting of a 
generator for each node, obeying the usual Coxeter group relations (i.e. all generators are 
involutions, and the product gg' of two generators has order 3 or 2, depending on whether 
or not the two nodes are adjacent), together with one more relation: 

{abib2aciC2adid2y° = 1 . (2.2) 

The groups Ypg^, for p < 5, have now all been identified (see e.g. [61]). Conway conjec- 
tured and, building on work by Ivanov [60], Norton proved [98] that Y555 = I444 is the 
'Bimonster', the wreathed-square M1C2 of the Monster (so has order 2|Mp). A closely 
related presentation of the Bimonster has 26 involutions as generators and has relations 
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given by the incidence graph of the projective plane of order 3; the Monster itself arises 
from 21 involutions and the afhne plane of order 3 [25]. Likewise, Y553 = I443 = M x C2. 
Other sporadics arise in e.g. Y533 = I433 (the Baby Monster B), Y552 = I442 (the Fischer 
group Fi24), and ^532 = I432 (the Fischer group Fi23)- The Coxeter groups of ^555, ^553, 
^533) ^552, and ^532, are all infinite groups of hyperbolic reflections in e.g. R-*^^'-*^, and con- 
tain copies of groups such as the aflfine £^8 Weyl group, so the geometry here should be 
quite pretty. What role, if any, these remarkable presentations have in Moonshine hasn't 
been established yet. As a first step though, [97] identifies in Aut{V'^) the 21 involutions 
generating M. 



The Monster has 194 conjugacy classes, and so that number of irreducible represen- 
tations. Its character table (and much other useful information) is given in the Atlas [22], 
where we also find analogous data for the other simple groups of 'small' order. For ex- 
ample, we find that M has exactly 2, 3, 4 conjugacy classes of elements of order 2, 3, 4, 
respectively — these classes are named 2A, 2B, 3A, etc. We also find that the dimensions 
of the smaUest irreducible representations of M are 1, 196883, 21296876, and 842609326. 
This is the same 196883 as on the right side of (1.1), and as the dimension of the Griess 
algebra. 

2.2. The j -function. The group SL2(M), consisting of 2 x 2 matrices of determinant 
1 with real entries, acts on the upper half-plane H := {r e C | Im(T) > 0} by fractional 
linear transformations 



Of course this is really an action of PSL2(M) := SL2(]R)/{±/} on H, but it is more conve- 
nient to work with SL2(M). H is the hyperbolic plane, one of the three possible geometries 
in two dimensions (the others arc the sphere and the Euclidean plane), and PSL2(M) is its 
group of orientation-preserving isometrics. 

Let G be a discrete subgroup of SL2(M). Then the space G\1HI has a natural structure 
of an orientable surface, and inherits a complex structure from H (so can be regarded as 
a complex curve). By the genus of the group G, we mean the genus of the resulting real 
surface G\E[. For example, the choice G = SL2(Z) yields the sphere with one puncture, so 
SL2(Z) has genus 0. Moreover, any curve S with genus g and n punctures, for 3g-\-n > 3, is 
equivalent as a complex curve to the space G\HI, for some subgroup G of SL2 (R) isomorphic 
to the fundamental group 7ri(S). 




Figure 1. The graph ^555 presenting the Bimonster 




(2.3) 
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The most important choice for G is SL2(Z), thanks to its interpretation as the modular 
group of the torus. Most groups G of interest are commensurable with SL2(Z), i.e. G fl 
SL2(Z) has finite index in both G and SL2(Z). Examples of these are the congruence 
subgroups 

d)^SM2)|(^ (modiV)}, (2.4a) 

Vq{N):={{^^ e SL2(Z) I N divides c} . (2.46) 

For example ro(A^) has genus for A'' = 2, 13, 25, while = 50 has genus 2 and = 24 
has genus 3. The following definition includes all groups arising in Monstrous Moonshine. 

Definition 1. Call a discrete subgroup G of S'L2(M) a moonshine-type modular 
group, if it contains some Vq{N), and also obeys the condition that 

1 t 



Q ^ . e G iff t e Z 

Such a modular group is necessarily commensurable with SL2(Z). Note that for such 
a G, any meromorphic function / : G\]HI — > C will have a Fourier expansion of the form 
f{r) = E"=-oo «n9", where q = e^--. 

Definition 2. Let G be any subgroup of SL2{M.) commensurable with 5^2 (Z). By a 
modular function / for G we mean any meromorphic function / : H — > C, such that 

and such that, for any A e SL2(Z), the function f{A.T) has Fourier expansion of the form 
Yl'^=-oo ^nO^^^ for some N and bn (both depending on A), and where 6„ = for all but 
finitely many negative n. 

This definition simply states that / is a meromorphic function on the compact surface 
Eg := G\M, where H := H U Q U {ioo}. The G-orbits of Q U {ioo} are called cusps; 
their role is to fill in the punctures of G\EI, compactifying the surface, as there are much 
fewer meromorphic functions on compact surfaces than on noncompact ones (compare the 
Riemann sphere to the complex plane!). 

We are especially interested in genus groups G of moonshine-type. Their modular 
functions are particularly easy to characterise: there will be a unique modular function Jq 
for G, with g-expansion of the form 

oo 

JG(T) = ?-' + X;«n9" ; (2.6) 

n=l 

the modular functions for G are precisely the rational functions /(r) = po|y(j^(^)) in Jg- 
This function Jc is called the (normalised) Hauptm,odul for the genus group G. For 
example, the modular group SL2(Z) has Hauptmodul 

^SL2(Z) (r) = J(t) = q-^ + 196884 q + 214 93760 q^ + 8642 909970 q^ + ■ ■ ■ . (2.7) 
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This 196884 is the same as that on the left-side of (1.1). Historically, in place of this 
Hauptmodul was the equivalent 

As we know, there are other genus modular groups. For example the Hauptmoduls 
for ro(2), ro(13), and ro(25), are respectively 

J2(t) = + 276? - 2048g2 + 11202g3 - 49152^^ + 184024?^ + • • • , (2.8a) 
Ji3(r) =q-^-q + 2q^ + + 2q^ - 2q^ - 2q^ - 2q^ + q^ + ■ ■ ■ , (2.86) 
J25(t) =q-'-q + q'' + q^- q^^ - q^^ + q^^ + ^24 _ ^26 ^ . . . _ (2.8c) 

Thompson [115] proved there are only finitely many modular groups of moonshine- 
type in each genus. Cummins [28] has found all of these of genus and 1. In particular 
there are precisely 6486 genus moonshine-type groups. Exactly 616 of these have Haupt- 
moduls with rational (in fact integral) coefficients, the remainder have cyclotomic integer 
coefficients. There are some natural equivalences (e.g. a Galois action) which collapse this 
number to 371, 310 of which have integral Hauptmoduls. 

In genus > 0, two functions are needed to generate the function field. A complication 
facing the development of a higher-genus Moonshine is that, unlike the situation in genus 
considered here, there is no canonical choice for these generators. 

See e.g. [92] for a very readable account of some of the circle of ideas meandering 
through this subsection. Modular functions are discussed in e.g. [75] . 

3. The Monstrous Moonshine conjectures 

The number on the left of (1.1) is the first nontrivial coefficient of the j-function, and 
the numbers on the right are the dimensions of the smallest irreducible representations of 
the Fischer-Griess Monster M. On the one side we have a modular function; on the other, 
a sporadic finite simple group. Moonshine is the explanation and generalisation of this 
unlikely connection. 

But first, why can't (1.1) merely be a coincidence? This is soon dispelled by comparing 
the next few coefficients of J with the dimensions of irreducible representations of M: 

214 93760 = 212 96876 + 196883 + 1 , (3.1a) 
8642 99970 = 8426 09326 + 212 96876 + 2 • 196883 + 2 • 1 . (3.16) 

3.1. The fundamental conjecture of Conway and Norton. The central structure in the 
attempt to understand equations (1.1) and (3.1) is an infinite-dimensional graded module 
for the Monster: 

V = Vo®Vi®V2®Vz®-- - . (3.2a) 

If we let pd denote the d-dimensional irreducible representation of M, then the first few 
subspaces will be Vq = pi, Vi = {0}, V2 = pi © pigesss, and V3 = pi © pigesss © P21296876. 
This module is to have graded dimension 

dimv (r) = ^ q''dim{Vn) = 1 + 196884?^ + 214 93760?^ + . . . = g J(r) . (3.26) 

n=0 
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Of course, (3.2b) alone certainly doesn't uniquely determine V, but assume for now this 
V has been found. Thompson [114] suggested studying in addition the graded traces 

oo 

Tg{T):=q-'J2^hvMq'' (3.2c) 

n=0 

for all g G M, where the chy^ are characters. As taking g = 1 recovers J, (3.2c) is a natural 
twist of (3.2b). The functions Tg are now called the McKay-Thompson series. 

Conway and Norton conjectured [24] that for each element g of the Monster M, Tg is 
the Hauptmodul 

oo 

JG,{r)^q-' + Y,an{g)q'' (3.3) 

n=l 

for a genus subgroup Gg of SL2(M). So for each n the coefficient g i— an{g) defines a 
character chy^ig) of M. They explicitly identify each of the groups Gg; these groups each 
contain To{N) as a normal subgroup, for some N dividing o(5r) gcd(24, o(5r)) {o{g) is the 
order of g), and the quotient group Gg/To(N) has exponent 2 (or 1). 

Since Tg = T^^^-i by definition, there are at most 194 distinct McKay-Thompson 
series. All coefficients an{g) are integers (as are in fact most entries of the character table 
of M). This implies that Tg = T^, whenever the cyclic subgroups (g) and {h) are equal. 
In fact, the total number of distinct McKay-Thompson series Tg arising in Monstrous 
Moonshine turns out to be only 171. The first 50 coefficients a^ig) of each Tg are given 
in [91]. Together with the recursions given in §3.3 below, this allows one to effectively 
compute arbitrarily many coefficients an{g) of the Hauptmoduls. It is also this which 
uniquely defines V, up to equivalence, as a graded M-module. 

For example, there are two different conjugacy classes of order 2 elements. One of 
these gives the Hauptmodul J2 in (2.8a), while the other corresponds to (3.4) below. 
Similarly, (2.8b) corresponds to an order 13 clement, but J25 in (2.8c) doesn't equal any Tg. 
Recall that there are exactly 616 Hauptmoduls of moonshine-type with integer coefficients, 
so most of these don't arise as Tg. Recently [23], a fairly simple characterisation has 
been found of the groups arising as Gg in Monstrous Moonshine. Their proof of this 
characterisation is by exhaustion. 

Conway coined this conjecture Monstrous Moonshine. The word 'moonshine' here is 
English slang for 'insubstantial or unreal', 'idle talk or speculation', 'an illusive shadow'. It 
is meant to give the impression that matters here are dimly lit, and that [24] is 'distilling 
information illegally' from the character table of M. 

Monstrous Moonshine began, unofficially, in 1975 when Andrew Ogg remarked that 
the list of primes p for which the group 

ro(pH:=(r„(p).-^(° -;)) (3.4) 

has genus 0, is precisely equal to the list of primes p dividing the order of M. Indeed, in 
the tables of [24] we find that, for each prime p dividing |M|, an element ^ of M of order 
p is assigned the group Gg = ro(p)-|-. 
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3.2. Lie theory and Moonshine. McKay not only noticed (1.1), but also observed that 



j(r)3 = g-3 (1 + 248g + 4124g2 + 34 752g3 + •••)• (3.5) 

The point is that 248 is the dimension of the defining representation of the Eg Lie group, 
while 4124 = 3875 + 248 + 1 and 34 752 = 30 380 + 3875 + 2-248 + 1. Incidentally, 
is a generating modular function for the genus-0 group r(3). Thus Moonshine is related 
somehow to Lie theory. 

McKay later found independent relationships with Lie theory [89], [15], [47], reminis- 
cent of his famous A-D-E correspondence with finite subgroups of SU2(C). As mentioned 
earlier, M has two conjugacy classes of involutions. Let K be the smaller one, called '2A' 
in [22] (the alternative, class '2B', has almost 100 million times more elements). The 
product of any two elements of K will lie in one of nine conjugacy classes: namely, lA, 
2A, 2B, 3A, 3C, 4A, 4B, 5A, 6A, corresponding respectively to elements of orders 1, 2, 2, 
3, 3, 4, 4, 5, 6. It is surprising that, for such a complicated group as M, that list stops 
at only 6 — we call M a 6-transposition group for this reason (more on this in §5.2). The 
punchline: McKay noticed that those nine numbers are precisely the labels of the affine 
Es diagram (see Figure 2). Thus we can attach a conjugacy class of M to each vertex 
of the E^ diagram. An interpretation of the edges in the E^ diagram, in terms of M, is 
unfortunately not known. 



1 2 

• ^ 



4 2 

X:» • 



Figure 2. The affine E'g, -F4, and G2 diagrams with labels 

We can't get the affine Ej labels in a similar way, but McKay noticed that an order 
two folding of affine E'j gives the affine -F4 diagram, and we can obtain its labels using the 
Baby Monster B (the second largest sporadic). In particular, let K now be the smallest 
conjugacy class of involutions in B (also labelled '2 A' in [22]); the conjugacy classes in 
KK have orders 1, 2, 2, 3, 4 (B is a 4-transposition group), and these are the labels of F4. 
Of course we'd prefer E^ to F4, but perhaps that two-io\dm.g has something to do with 
the fact that an order- iw;o central extension of B is the centraliser of an element e M of 
order two. 

Now, the tnp/e- folding of affine E^ is affine G2- The Monster has three conjugacy 
classes of order three. The smallest of these ('3A') has a centraliser which is a triple cover 
of the Fischer group ^^24. 2. Taking the smallest conjugacy class of involutions in Fi'24^.2, 
and multiplying it by itself, gives conjugacy classes with orders 1, 2, 3 (hence -F'^24.2 is a 
3-transposition group) — and those not surprisingly are the labels of G2! 
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Although we now understand (3.5) (see §4.1) and have proven the fundamental Conway- 
Norton conjecture (see §§4.2-4.4), McKay's Eg., F4, G2 observations still have no explana- 
tion. In [47] these patterns are extended, by relating various simple groups to the 
diagram with deleted nodes. 

3.3. Replicable functions. There are several other less important conjectures. One 
which played an important role in ultimately proving the main conjecture involves the 
replication formulae. Conway-Norton want to think of the Hauptmoduls Tg as being 
intimately connected with M; if so, then the group structure of M should somehow directly 
relate different Tg. Considering the power map g ^ g'^ leads to the following. 

It was well-known classically that j{T) has the property that jipr) + 

■ ■ ■ + j{ '^~^p~^ ) is a polynomial in j(t), for any prime p {proof: it's a modular function for 
SL2(Z), and hence equals a rational function of j (t); since its only poles will be at the 
cusps, the denominator polynomial must be trivial). Hence the same will hold for J. More 
generally, we get 

ad=n,0<b<d 

where Qn is the unique polynomial for which Qn{J{j)) — Q~"' has a g-expansion with only 
strictly positive powers of q. For example, Q2{x) = — 2ai and Q3{x) = — 3aix — 3a2, 
where we write J(t) = On?"'. The left side of (3.6a) is really a Hecke operator applied 
to J. These equations (3.6a) can be rewritten into recursions such as — a3 + (a^ — ai)/2, 
or collected together into the remarkable expression (originally due to Zagier) 

P~^ Yli^- p"'g")'''"" = J{z) - J{t) , (3.66) 

where p = e^'^'^. 

Conway and Norton conjectured [24] that these formulas have an analogue for any 
McKay-Thompson series Tg. In particular, (3.6a) becomes 

E Tgai^:^) = Q^,g{Tg{r)) , (3.7a) 

ad=n,0<b<d 

where Qn,g plays the same role for Tg that Qn played for J. These are called the replication 
formulae. Again, these yield recursions like a^ig) = a2{g) + {ai{g)'^ — ai{g^))/2, or can be 
collected into the expression 

p-' exp[- E E «-n(/)^^] = Tgiz) - Tg{T) . (3.76) 

fe>0 "»>o 

This looks a lot more complicated than (3.6b), but you can glimpse the Taylor expansion 
of ln(l —p^q^) there and in fact for = 1 (3.7b) reduces to (3.6b). 
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Axiomatising (3.7a) leads to Norton's notion of replicable function [96], [1]. Write 
/(i)(t) = + E^i^i^^?''. and replacing each Tga in (3.7a) with use (3.7a) to 

recursively define each Z^"^-*. If each f^'^^ has a g-expansion of the form f^^^r) = + 
X^fcLi i>k^ q'^ — i.e. no fractional powers of q arise — then we call / = f^^^ replicable. 
Equation (3.7a) says the McKay-Thompson series are replicable, and [30] proved that 
the Hauptmodul of any genus modular group of moonshine-type is replicable, provided 
its coefficients are rational. Conversely, Norton conjectured that any replicable function 
with rational coefficients is either such a Hauptmodul, or one of the 'modular fictions' 
/(r) = /(t) = q~^ ± q. This conjecture seems difficult and is still open. Incidentally, 

if the coefficients h')^^ are irrational, then the definition (3.7a) of replicability should be 
modified to include Galois automorphisms (see §8 of [29]). Replication in positive genus 
is discussed in [109]. 

Replication (3.7a) concerns the power map g ^ g'^ hv M. Can Moonshine see more 
of the group structure of M? We explored one step in this direction in §3.2, where McKay 
modeled products of conjugacy classes using Dynkin diagrams. A different idea is given 
in §5.1. It would be very desirable to find other direct connections between the group 
operation in M and e.g. the McKay-Thompson series. 

3.4- The Leech lattice and Moonshine. The Leech lattice A = A24 is a 24-dimensional 
even self-dual lattice [26] which is to lattices much as the M-module V of (3.2a) turns 
out to be for vertex operator algebras (see §4.2 below). A has no vectors of odd norm, no 
norm-2 vectors, and precisely 196560 norm-4 vectors — a number remarkably close to the 
monstrous 196883. In fact its theta series 0a(^) = X^^ga^" "^^' when divided by r7(T)^^, 
equals J(r) + 24. Is this another example of Moonshine? 

Indeed it is. However we have: 

Theorem 3. Let L C he any n- dimensional positive-definite lattice whose norms 
V • V are all rational. Let t e be any vector with finite order in L: i.e. mt e L for some 
nonzero m e Z. Then the theta series 

divided by r]{T)'^ , is a modular function for some T{N). 

See e.g. Theorem 20 of [100] for a proof of this classical result. If L is in fact an even 
lattice (i.e. all norms v- v lie in 2Z), we can say more. Let L* :— {x G | x-L C Z} be the 
dual lattice. It contains L with finite index; write tj + L, z = 1, . . . , M, for the finitely many 
cosets in L*/L. Define a column vector XLi^) with zth component 6t^+L(T)/?7(T)". Then 

XL forms a vector-valued modular function for SL2(Z): for any A = ^) ^ SL2(Z), 

xd^^)-PiA)xL{r) (3.8) 

for some M-dimensional unitary matrix representation p of SL2(Z). In particular, for the 
Leech lattice L = A, M = 1 and we can quickly identify 0a(t) in terms of J(t). Although 
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the 196560 196884 coincidence is thus trivial to explain, it will turn out to be a very- 
instructive example of Moonshine. 

The lattices are related to groups through their automorphism groups, which will al- 
ways be finite for positive-definite lattices. The automorphism group Coq :— Aut(A) of 
the Leech lattice has order about 8 x 10^^, and is the direct product of C2 with Con- 
way's sporadic group Coi. Several other sporadics are also involved in Aut(A). To each 
automorphism a G Aut(A), let 9 a denote the theta series of the sublattice of A fixed 
by a. [24] also associate to each automorphism a a certain function tjair) of the form 
YliVi^'i^) / Ylj vibj'!') built out of the Dedekind eta. Both 6*0, and rja are constant on each 
conjugacy class in Aut(A), of which there are 202. [24] remarks that the ratio 9a/r]a 
always seems to equal some McKay-Thompson series Tg(^a)- See also [87]. 

It turns out that this observation isn't quite correct [74]. For each a G Aut(A), the 
subgroup of SL2(]R) which fixes 6 a Ma is indeed genus 0, but for exactly 15 conjugacy 
classes in Aut(A), dajr^a is not the Hauptmodul. 

Similarly, one can ask this for the lattice, whose automorphism group is the Weyl 
group (of order 7 x 10^) of the iJg Lie group. The automorphisms a of the £^8 lattice 
which yield a Hauptmodul were classified in [19]. 

Jf.. Proof of the Monstrous Moonshine conjectures 

At first glance, the significance of the Moonshine conjectures seems very unlikely: they 
constitute after all a finite set of very specialised coincidences. The whole point though 
is to try to understand why such seemingly incomparable objects as the Monster and the 
Hauptmoduls can be so related, and to try to extend and apply this understanding to other 
contexts. Establishing the truth (or falsity) of the conjectures was merely meant as an aid 
to uncovering the why of Moonshine. Indeed, in achieving this understanding, important 
new algebraic structures were formulated. We will sketch this theory below. 

The main Conway-Norton conjecture was attacked almost immediately. Thompson 
showed [113] (see also [103]) that if (7 hh^ an{g) is a character for all sufficiently small 
n (apparently n < 1300 is sufficient), then it will be for all n. He also showed that if 
certain congruence conditions hold for a certain number of a^^g) (all with n < 100), then 
all g 1-^ an{g) will be virtual characters (i.e. differences of true characters of M). Atkin, 
Fong, and Smith (see [110] for details) used that to prove on a computer that indeed all 
an{g) were virtual characters (they didn't quite reach n = 1300 though). But their work 
doesn't say anything more about the underlying (possibly virtual) representation V, other 
than its existence. Their work plays no role in the following. 

We want to show that the McKay-Thompson series T'g(r) of (3.2c) equals the Haupt- 
modul JogiT) in (3.3). First, we need to construct the infinite-dimensional module V of M. 
We discuss this, and the underlying theory of vertex operator algebras, in §4.2. Borcherds' 
strategy [11] was to bring in Lie theory, by associating to the module V a 'Monster Lie 
algebra'. This algebra, and the underlying theory of generalised Kac-Moody algebras, 
is described in §4.3. In the final subsection we go from the Monster Lie algebra to the 
replication formulae, and conclude the proof. We begin this section though by explaining 
the much simpler connection of Eg with j 3 . 

4-1- Eg and j^. An explanation for the relation between £^8 and was found 
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almost immediately, by Kac and Lepowsky [65], [76]: is the (normalised) character of 
a representation of the affine Kac-Moody algebra ^'g^''. Given a finite-dimensional simple 
Lie algebra g, the affine algebra q^^^ is the infinite-dimensional Lie algebra consisting of all 
Laurent polynomials ant"' where a„ G g and t is an indeterminate, together with 

a central term and derivation D = —Lq (see [66], [70]). Highest weight representations 
are defined in the usual way. Thanks largely to the fact that the affine Weyl group is a 
semi-direct product of the additive group (r = rank(g)) with the finite Weyl group, the 
characters of these representations (especially the 'integrable' highest weight ones, which 
are the direct analogue of the finite-dimensional representations of q) transform nicely 
with respect to SL2(Z). See e.g. Chapter 13 of [70] for details. This is probably the single 
biggest reason Kac-Moody algebras are so well-known. 

Theorem 4. [69] Let g be any finite- dimensional Lie algebra, and q^^^ denote the 
corresponding affine algebra. Let denote the finitely many 'level k integrable highest 
weight modules' L\ of The Q^^^ -module L\ has a natural Z-grading L\ = ©^o(-^A)n 
into finite- dimensional g-modules {Lx)n- Let xxij) = 1^^~'^^'^^Yll^=o^^^{{L\)n) q" be 
the corresponding normalised character (for some appropriate choice of h\ — c/24 G QJ. 
Then each x\ is a holomorphic function in H, and the vector Xk{'T) with entry X\{'t) for 
each A G defines a vector-valued modular function for SL2{'L), as in (3.8), for some 
finite- dimensional unitary representation p of SL2{'L). 

In fact each character x\ will be a rational function in lattice theta series, and so will 
be a modular function for some r(A'"). It turns out that there is only one level 1 integrable 
highest-weight representation of and its character equals . The modularity of 
is thus predicted by Kac-Moody theory, and the fact that the coefficients are dimensions 
of -Eg representations is automatic. We will see a simultaneous generalisation of Theorems 
3 and 4 next section. 

We've already encountered the mysterious normalisation of the McKay-Thompson 
series, and the of the e'^^ character, and more generally the q^^-^/"^^ of x\- Many 
explanations have been provided for this pervasive factor. For example, to [2] it is topolog- 
ical in origin, and related to the Atiyah-Singer Index Theorem; a geometric interpretation 
using determinant line bundles is due to Segal [107]. In quantum physics it's called the 
conformal anomaly (a breakdown of manifest conformal symmetry when the classical sys- 
tem is quantised), and is introduced in regularisation as a vacuum energy. Probably the 
simplest instance of it is the prefactor in the familiar definition ri{T) — Y['^=ii^~Q^) f*^^ 
the Dedekind eta: reading through classical proofs for its modularity we find that '^' here 
arises through the combination C(2)/(27r)^; rj appears also in physics in the partition func- 
tion of the bosonic string, and that same '^' arises there via regularisation as — C(— 1)/2. 
The equivalence of these two expressions for comes from the functional equation of the 
Riemann zeta. This same zeta value appears famously in the central term of the Virasoro 
algebra (4.3), and Bloch [8] found other zeta values appearing in other algebras of differ- 
ential operators, many of which have now been interpreted and generalised (starting with 
[77]) within the vertex operator algebra framework. 

Although a direct explanation for Monstrous Moonshine using affine algebras has never 
been found (and certainly isn't expected), the theory of Kac-Moody algebras influenced 
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every stage of the ultimate proof. For this reason we'll briefly sketch their theory. A 
simple finite-dimensional Lie algebra g is built out of the 3-dimensional algebra SI2, in a 
simple way; the Dynkin diagram of Q encodes the exact presentation. In the identical way, 
Kac-Moody algebras are also built out of copies of SI2 — the only difference is that the 
finite-dimensionality constraint (a positive-definiteness condition on the Cartan matrix) is 
lifted. Their structure is completely analogous to that of the simple Lie algebras: e.g. it 
has a grading by roots into finite-dimensional spaces; it has a triangular decomposition 
(making Verma modules possible); it has an invariant symmetric bilinear form. See e.g. 
[66] , [70] for details. The affine algebras are the class of Kac-Moody algebras especially 
analogous to the finite-dimensional ones. 

4.2. The Moonshine module VK A vital component of the Monstrous Moonshine 
conjectures came a few years after [24]. In a deep work, Frenkel-Lepowsky-Meurman 
[43], [44] constructed a graded infinite-dimensional representation of M and conjectured 
(correctly) that it is the representation V in (3.2a). V''^ has a very rich algebraic structure: 
it is in fact a vertex operator algebra! 

A vertex operator algebra [9], [44], [67], [39], [79] is an infinite-dimensional vector 
space V with infinitely many heavily constrained bilinear products u*nV. The name means 
'algebra of (generalised) vertex operators'; vertex operators are formal differential oper- 
ators which originally appeared in physics as quantum fields describing the creation and 
propagation of physical strings (see §6 below), and were constructed later but indepen- 
dently by Lie theorists (starting with Lepowsky and Wilson) to realise affine Kac-Moody 
algebras as algebras of differential operators. Because there were vertex operator construc- 
tions associated to lattices, affine algebra modules, and string theory, and all of these have 
connections to modular functions, it was natural to use vertex operators to try to construct 
the M-module V of (3.2a). 

The definition of vertex operator algebra (VOA) is too complicated to give in detail 
here. A VOA is a graded infinite-dimensional vector space V = ©J^o^n, where each Vn is 
finite-dimensional. To simplify the discussion, we will limit ourselves in this paper to VOAs 
with one-dimensional Vq, which is typical of the examples relevant to Moonshine (and 
conformal field theory). To each vector f G V we assign a vertex operator Y{v, z), which is 
a formal power series Y{v,z) — '^rne'z''^{m)^~^~^ : with coefficients V(^rn) ^ End(V). The 
vertex operator is just the generating function for the products: u*nV = W(„)(f). These 
products respect the grading — in particular, 

Vfc *n Vi C Vk+£-n-l ■ (4.1) 

A key axiom, which collects together all the identities obeyed by the products u*nV, can 
be written as 

{z-w)^[Y{u,z),Y{v,w)] = yu,veV, (4.2o) 

for some integer M (depending on u, v), where the bracket in (4.2a) means the commutator 
Y{u, z) Y{y, w) — Y{v, w) Y{u, z). This strange-looking formula really says that each such 
commutator is a linear combination of Dirac deltas and their derivatives, all centred at 2; = 
w (see e.g. Corollary 2.2 in [67]). Equation (4.2a) implies more down-to-earth identities. 
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such as 

{U£v)n = ^(-1)* i . J [Ui-i O Vn+i - (-l)^V^+n-i O Ui) . (4.26) 

There are two distinguished elements in V: the identity 1 G Vo (so Vo = CI) and 
the conformal vector u G V2. The identity obeys Y{l^z) = id, i.e. l(n)V = dn,-iv. More 
interesting is the conformal vector: writing L„ = u>(^n+i): the operators are required to 
form a representation on V of the Virasoro algebra: 

[Lm, Ln] = {m- n)Lm+n + ^m,-n ^2 ^^^^ ' ^^"^-^ 

for some number c G M (an important numerical invariant of V) called the rank or central 
charge of the VOA. In addition we require Lqu = nu whenever tt G Vn, and L_i acts on V 
as a derivation. 

The appearance here of the Virasoro algebra is fundamental. It is the unique nontrivial 
central extension of the (polynomial) vector fields on 5"^, which in turn is the Lie algebra 
associated to the group Diff(S'^) of diffeomorphisms of the circle. 

The notion of a VOA may seem very arbitrary, but as we'll mention in §6 it is the 
'chiral algebra' of a conformal field theory. The simplest (and least interesting) special case 
of a VOA occurs when M = in (4.2a) — i.e. when all vertex operators commute. Then 
it is not hard to see that, for each choice of z ^ 0, V would be a commutative associative 
algebra with unit, whose product is given hj u *z v :— Y{u, z)v. A more honest way to 
motivate VOAs has been suggested by Huang: binary trees can be used to keep track of the 
brackets in nested products, e.g. a{{hc)d), and e.g. Lie algebras can be easily formulated 
using this language [58] ; in a monumental work [59] , Huang 'two-dimensionalised' this Lie 
algebra formulation by replacing binary trees with spheres with tubes, and showed that 
the result is equivalent to a VOA. 

A relation between VOAs and Lie algebras also exists at a more elementary level. 
Placing £ = n = in (4.2b), and evaluating it on the right by G V, gives 

{uqv)ow = uq{vqw) - vo{uqw) . (4.4) 

Writing [xy] for xqJ/, this becomes the Lie algebra Jacobi identity, at least when [xy] — 
— [yx]. There are different ways to obtain from this a true Lie algebra. The simplest 
is that Vi will be a Lie algebra for that bracket; moreover, the number (m, v) defined by 
uiv = {u, v)l is an invariant bilinear form for this Lie algebra (by (4.2b) with i — 0,n = 1). 
Equation (4.4) tells us each Vn is a Vi-module, and in fact e" will be an automorphism of 
V for any w G Vi. In the most common examples, Vi will be reductive (i.e. a direct sum 
of simple and abelian Lie algebras). 

Modules M of V can be defined in the obvious way [42] , [79] — e.g. for each w G V, each 
tt(„) will be in End(M). All (irreducible) modules M come with a Z-grading M = (B'^^oMn, 
where UkM^ C Mn+e-k-i for all u G V^, and Lqx = {n + h)x for any x G M„, where h is 
some number (the conformal weight) depending only on M. The (normalised) character 
of M is 

oo 

xm(t) = Q-'^/^^TrMg^" = g^^-^/'" Yl ^MMn) g" • 

n=0 



13 



It takes a little effort to construct even the simplest examples of VOAs. The best- 
behaved ones are called rational VOAs [124] (borrowing on terminology from physics) and 
have only finitely many irreducible modules. A rational VOA is associated to any even 
positive-definite lattice L, and their modules are in one-to-one correspondence with the 
cosets L*/L. Another important example: to any affine nontwisted Kac-Moody algebra 
and choice of positive integer k (the 'level'), the highest weight module LkAo has a natural 
VOA structure, and its modules are precisely the affine algebra modules Lx for each highest 
weight X e P'^. 

One of the deepest results in the theory of VOAs is due to Zhu: 

Theorem 5. [124] Let V be a rational VOA. Its characters xm{t) are holomorphic 
in H, and the subspaces carry representations for Aut{V). Write Xv{t) for the vector 
whose components are the characters XMi^) of irreducible modules M. Then xv 
vector-valued modular function for SL2{'L). 

It is believed that the characters Xm{t) themselves will be modular functions for some 
r(A^); significant progress towards this was made in [5] (see also [71]). The proof of Zhu's 
Theorem is much more difficult than that of Theorem 4, which it generalises. 

The automorphism group Aut(V) is by definition required to fix which is why it 
respects the grading of V. Aut(V) is how group theory impinges on VOA theory. Since the 
automorphism group Aut(V) of a VOA contains e^^ as a (normal) subgroup, Aut(V) can 
be finite only when Vi = 0. Zhu's Theorem tells us that Moonshine (without the genus-0 
aspect) will hold between the group Aut(V) and the functions Xm{j)-, for any rational 
VOA. 

The most famous example of a VOA is the Moonshine module V^ of [44]. It is the 
orbifold of the Leech lattice VOA Va by the ±l-symmetry of A, which means it's the direct 
sum of two parts: an invariant part and a twisted part V^ (more on this in §5.1). The 
orbifold serves two purposes: it removes the constant term '24' from the graded dimension 
J -I- 24 (hence the subspace (Va)i) of Va; and it enhances the symmetry from the discrete 
part of Aut(VA), which is an extension of Coq by (6*2)^^, to all of M. 

A major claim of [44] was that is a 'natural' structure (hence their notation). Even 
so, this bipartite structure to V'^ complicates its study. We have Vq = CI, as usual, but 
the Lie algebra Vi = {0} is trivial. For any such VOA, the space V2 will be a commutative 
nonassociative algebra with product u x v :— uiv and identity ^ui. For the Moonshine 
VOA this can be shown (with effort!) to be the 196883-dimensional Griess algebra 
extended by an identity element. From this, we find the automorphism group of to 
be the Monster M. The only irreducible module for V^ is itself — such a VOA is called 
holomorphic. Together with Zhu's Theorem, this implies that its character, namely J(t), 
must be a modular function for SL2(Z) (strictly speaking, we only get invariance up to a 
1-dimensional character of SL2 (Z) , but it is easy to show that character must be identically 
1). We'll see in §5.1 how to obtain the other McKay-Thompson series from V^. 

Conjecturally, there are 71 holomorphic VOAs with rank c = 24 [106]. Much as the 
Leech lattice is the unique even self-dual positive-definite lattice of dimension 24 contain- 
ing no norm-2 vectors [26], the Moonshine module V^ is conjecturally [44] the unique 
holomorphic VOA with c = 24 and with trivial Vi. Thus, just as the Leech lattice is the 
unique lattice with theta series ©a, so (conjecturally) is the Moonshine module the unique 
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holomorphic VOA with (normalised) graded dimension J. Proving this is one of the most 
important (and difficult) challenges in the subject. 

4-3. The Monster Lie algebra m. To show that all of the McKay-Thompson series Tg 
are indeed Hauptmoduls, Borcherds needed identities satisfied by their g-expansions. He 
obtained these through a Lie algebra he associated to V^. Before discussing it, let's briefly 
describe Borcherds' generalisation of Kac-Moody algebras [10]. 

A Borcherds-Kac-Moody algebra differs from a Kac-Moody algebra in that it is 
built up from Heisenberg algebras as well as SI2, and these subalgebras intertwine in more 
complicated ways. Nevertheless much of the theory for finite-dimensional simple Lie al- 
gebras continues to find an analogue in this much more general setting (e.g. root-space 
decomposition, Weyl group, character formula,...). This unexpected fact is the point of 
Borcherds-Kac-Moody algebras. For reasons of space we avoid giving here the fairly simple 
definition, but for this and much more see the review articles [53] , [63] , [102] . 

Their basic structure theorem is that of Kac-Moody algebras. In particular, there is a 
grading by roots into finite-dimensional spaces (except that the 0-graded piece, correspond- 
ing to the Cartan subalgebra, may be infinite-dimensional). They also have a triangular- 
isable decomposition and an invariant symmetric bilinear form. Indeed, these structural 
properties characterise Borcherds-Kac-Moody algebras. In this sense Borcherds-Kac- 
Moody algebras are the ultimate generalisation of simple Lie algebras, in that any further 
generalisation would lose some basic structural ingredient. 

In short, Borcherds' algebras strongly resemble the Kac-Moody ones and constitute a 
natural and nontrivial generalization. The main differences are that they can be generated 
by copies of the 3-dimensional Heisenberg algebra as well as SI2, and that there can be 
imaginary simple roots. Borcherds introduced these algebras and developed their theory 
in order to understand the Monster Lie algebra m. 

We want to construct m from the Moonshine mo dulc = VS ®V} For later 

convenience, relabel its subspaces := V^_^i. Of course the obvious choice = V*^ is 
0-dimensional, so we must modify V'^ first. Let denote the even self-dual indefinite 
lattice consisting of all pairs (m, n) e I? with inner product (m, n) ■ (m', n') = mn' + nm' . 
Because it is indefinite, the usual construction of a VOA from a lattice will fail here to 
produce a true VOA, but most properties will be obtained. Call this near- VOA, Vi,i. 

The Monster Lie algebra m is a Lie algebra associated to the near- VOA V^^Vii 
— see [11] for the details, m inherits a //1,1-grading from Vi,i, and this is its root 
space decomposition: the (m, n) root space is isomorphic (as a vector space) to y"^", if 
(m, n) ^ (0,0); the (0,0) piece is isomorphic to IR^. Structurally, the Monster Lie algebra 
has a decomposition m = ti+ © gl2 © u~ into a sum of Lie subalgebras, where are free 
Lie algebras (see e.g. [64]). It inherits the action of M from V^. 

This construction of m may seem indirect; an alternate approach, anticipated in [11] 
and [12], uses Moonshine cohomology [81] — a functor, inspired by BRST cohomology in 
conformal field theory, assigning to certain c = 2 near-VOAs some Lie algebra carrying an 
action of M. To Vi,i this functor associates m. 

4- 4- Denominator identities and modular equations. It was discovered early on that 
the Hauptmoduls all obey the replication formulae, and that anything obeying those for- 
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mulae will be determined by their first few coefficients. The idea then is to show that 
the McKay-Thompson series Tg of (3.2c) also are replicable. Borcherds did this using Lie 
algebra denominator identities [11]. 

Finite-dimensional simple Lie algebras q possess a very useful formula for their char- 
acters, due to Weyl: the (formal) character x\ oi a module L\ equals 

X. := E = e"^ ^ ^ \ . (4.5) 

where W is the Weyl group, A_|_ the positive roots, e(w) = det(z«) is a sign, and where 
©^L;^(/x) is the weight-space decomposition of Lx- As the weights fi by definition lie in the 
dual f)* of the Cartan subalgebra of g, the character x\ can be regarded as a complex-valued 
function on the space f) = C (r = rank(£|)). 

Consider the trivial representation: i.e. x ^ for all x G g. Its character xo will 
be identically 1. Thus the character formula (4.5) tells us that a certain alternating sum 
over a Weyl group, equals a certain product over positive roots. These formulas, called 
denominator identities, are nontrivial even in this finite-dimensional case. 

In a famous paper [83], Macdonald generalised the denominator identity for (4.5), 
to infinite sum/product identities, corresponding to the extended Dynkin diagrams. The 
simplest one was known classically as the Jacobi triple product identity: 

oo oo 

J2 {-iTx'^'y'' ^Wil- x''^){l - x^'^-^y)(l - a;^™-^"') • (4.6) 

n= — oo m=l 

Macdonald's identities were later reinterpreted, by Kac and Moody, as denominator identi- 
ties for the affine algebras. For example, we now know (4.6) to be the denominator identity 
for the algebra A^^\ 

In particular, the same formula (4.5) holds for Kac-Moody algebras, except that the 
sum and product are now infinite, the positive roots now come with multiplicities, and 
the characters are usually normalised by a prefactor qf'*>^-c/24 rpj^^ variable r in Theorem 
4 is one of the coordinates in the Cartan subalgebra C^^^ of the affine algebra (see e.g. 
equation (13.2.4) of [66]). In that theorem we dropped the remaining variable dependence 
of the xa for readability, although those additional coordinates serve the important role of 
guaranteeing linear independence of the characters, and of giving us an action of SL2(Z) 
rather than merely PSL2(Z). 

Because a Borcherds-Kac-Moody algebra Q is triangularisable, highest weight q- 
modules can be defined in the usual way from Verma modules. The character formula 
becomes 



naeA^(l-e-«)^ 



where S\ is a correction factor due to imaginary simple roots. 

The corresponding denominator identity of the Monster Lie algebra m can be com- 
puted, and is given in (3.6b). Its Weyl group is C2 and sends the (m, n)-root space to 
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(n, m); the (m, n) root has multiphcity given by coefficient amn of J; for each n > we 
have an imaginary simple root {l,n) with multiphcity a^. Because of a cohomological in- 
terpretation of all denominator identities, (3.6b) can be 'twisted' by each (7 G M, and this 
gives (3.7b). These formulas are equivalent to the replication formula (3.7a) conjectured 
in §3.3. 

Identities equivalent to (3.7b) were obtained by more elementary means — i.e. meth- 
ods requiring less of the theory of Borchcrds-Kac-Moody algebras — in [64] and [68], 
permitting a simplification of Borcherds' proof at this stage. 

Now, it turns out that if we verify for each conjugacy class Kg of M that the first, 
second, third, fourth and sixth coefficients of the McKay-Thompson series Tg and the 
corresponding Hauptmodul Jq agree, then indeed Tg = Jq . That is precisely what 
Borcherds then did: he compared finitely many coefficients, and as they all equalled what 
they should, this concluded the proof [11] of Monstrous Moonshine! 

However, this case-by-case verification occurred at the critical point where the McKay- 
Thompson series were being compared directly to the Hauptmoduls, and so provides little 
insight into why the Tg are genus 0. Fortunately a more conceptual explanation of their 
equality has since been found. 

A function / obeying the replication formulae (3.7a) will also obey modular equations 
— i.e. a 2-variable polynomial identity satisfied by f{x) and f{nx). The simplest examples 
come from the exponential and cosine functions: note that for any n > 0, exp(na;) = 
(exp(x))"' and cos(nx) = T„(cos(a;)) where T„ is a Tchebychev polynomial. It was known 
classically that j (hence J) satisfied a modular equation for any n: e.g. put X = J(t) and 
Y = J(2t), then 

{X^ -Y){Y^ - X) ^ 393768 {X^ + Y^)+ 42987520 XY + 40491318744 {X + Y) 

- 120981708338256 . 

The only functions /(r) = + aiq + ■ ■ ■ which obey modular equations for all n, 
are J{t) and the 'modular fictions' q~^ and q~^ ± q (which are essentially exp, cos, and 
sin) [72]. More generally, we have: 

Theorem 6. [29] A function B{t) = ^nQ^ which obeys a modular equation 

for alln = 1 (mod N), will either be of the form B{t) — q~^+biq, or will be a Hauptmodul 
for a modular group of moonshine-type. 

The converse is also true [29]. The denominator identity argument tells us each Tg 
obeys a modular equation for each n = 1 modulo the order of g, so Theorem 6 then 
concludes the proof of Monstrous Moonshine. 

The computer searches in [20] suggest that the hypothesis of Theorem 6 may be 
considerably weakened, perhaps all the way down to the existence of modular equations 
for any two distinct primes. 

5. Further developments 

5.1. Orbifolds. About a third of the McKay-Thompson series Tg will have some neg- 
ative coefficients. In §5.4 we'll see Borcherds interpret them as dimensions of superspaces 



17 



(which come with signs). In an important announcement [97], on par with [24], Norton 

proposed that, although Tg( — 1/r) wiU not usually be another McKay-Thompson series, 
it will always have nonnegative integer g-coefhcients, and these can be interpreted as ordi- 
nary dimensions. In the process, he extended the g ^ Tg assignment to commuting pairs 
[g, /i) e M X M. 

In particular, to each such pair we have a function N{g,h;T), which we will call a 
Norton series, such that 

N{g'^h'',g'^h'';T) = aN{g,h;^±^) v(^^ e SL,{Z) , (5.1) 

for some root of unity a (of order dividing 24, and depending on g,h,a,b,c,d). The 
Norton series N{gj h; r) is either constant, or generates the modular functions for a genus-0 
subgroup of SL2(Z) containing some r(A^) (but otherwise not necessarily of moonshine- 
type). Constant N{g,h;T) arise when all elements of the form g'^h^ (for gcd(a, 6) = 1) 
are 'non-Fricke' (an element e M is called Fricke if the group Gg contains an element 
sending to icxo — the identity 1 is Fricke, as are 120 of the 171 Gg). Each N{g, h: r) has 
a g"^-expansion for that N; the coefficients of this expansion are characters evaluated at h 
of some central extension of the centralizer Cmig)- Simultaneous conjugation of g, h leaves 
the Norton series unchanged: N{aga~^,aha~^;T) = N{g, h; r). 

For example, when {g, h) = C2 x C2 and g, h, gh are all in class 2A, then N{g, h; r) = 
J(t) - 984. The McKay-Thompson series are recovered by the g = 1 specialisation: 
N{l,h;T) — Thij). This action (5.1) of SL2(Z) is related to its natural action on the 
fundamental group 7? of the torus, as we'll see in §6, as well as a natural action of the 
braid group, as we'll see next subsection. Norton arrived at his conjecture empirically, by 
studying the data of Queen (see §5.3). 

The basic tool we have for approaching Moonshine conjectures is the theory of VOAs, 
so we need to understand Norton's suggestion from that point of view. For reasons of 
space, we'll limit this discussion to V^, but it generalises. Given any automorphism g e 
Aviiiy^), we can define ^-twisted modules in the obvious way [36]. Then for each ^ e M, 
there is a unique gf-twisted module, call it V'^{g), for — this statement generalises 
the holomorphicity of mentioned in §4.2. More generally, given any automorphism 
h e Autiy^) commuting with g, h will yield an automorphism of V^{g), so we can perform 
Thompson's twist (3.2c) and write 

q-'^'^^'TYy.^g^hq'^' =: Z{g,h;T) . (5.2) 

These Z{g,hys can be thought of as the building blocks of the graded dimensions of 
various eigenspaces in V^{g): e.g. if h has order m, then the subspace of V''^{g) fixed by 
automorphism h will have graded dimension X^I^i -^{9^ ^*)- ^^e case of the Monster 
considered here, we have Z{g, h) = N{g, h). 

The important paper [36] proves that, whenever the subgroup {g, h) generated by g 
and h is cyclic, then N{g,h) will be a Hauptmodul satisfying (5.1). One way this will 
happen of course is whenever the orders of g and h are coprime. Extending [36] to all 
commuting pairs g, h is one of the most pressing tasks in Moonshine. 
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This orbifold construction is the same as was used to construct F'' from Va: F'^ is the 
sum of the 'i'-invariant subspace of Va with the 'i'-invariant subspace V'^ of the unique 
'— I'-twisted module for Va, where t G Aut(A) is some involution. The graded dimensions 
of are 2 ^(Z(±l, 1) + Z{±1, l)), respectively, and these sum to J. 

The orbifold construction is also involved in an interesting reformulation of the Haupt- 
modul property, due to Tuite [116]. Assume the uniqueness conjecture: is the only 
VOA with graded dimension J. He argues from this that, for each g G M, Tg will be a 
Hauptmodul iff the only orbifolds of are Va and itself. In e.g. [62], this analysis is 
extended to some of Norton's N{g, /i)'s, where the subgroup {g, h) is not cyclic (thus going 
beyond [36]), although again assuming the uniqueness conjecture. 

5.2. Why the Monster? That M is associated with modular functions can be explained 
by it being the automorphism group of the Moonshine VOA V^. But what is so special 
about this group M that these modular functions Tg and N{g, h) should be Hauptmoduls? 
This is still open. One approach is due to Norton, and was first (rather cryptically) stated 
in [97]: the Monster is probably the largest (in a sense) group with the 6-transposition 
property. Recall from §3.2 that a /c-transposition group G is one generated by a conjugacy 
class K of involutions, where the product gh of any two elements of K has order < k. For 
example, taking K to be the transpositions in the symmetric group G = Sn, we find that 
Sn is 3-transposition. 

A transitive action of F := PSL2(Z) on a finite set X with one distinguished point 
xq E X , is equivalent to specifying a finite index subgroup Fq of F. In particular, Fq is the 
stabiliser G F | g.xo = xq} of xq, X can be identified with the cosets Fo\F, and xq with 
the coset Fq. (If we avoid specifying xq, then Fq will be identified only up to conjugation.) 

To such an action, we can associate an interesting triangulation of the closed surface 
Fo\]H[, called a (modular) quilt. The definition, originally due to Norton and further devel- 
oped by Parker, Conway, and Hsu, is somewhat involved and will be avoided here (but see 
especially Chapter 3 of [57]). It is so-named because there is a polygonal 'patch' covering 
every cusp of Fo\]H[, and the closed surface is formed by sewing together the patches along 
their edges ('seams'). There are a total of 2n triangles and n seams in the triangulation, 
where n is the index ||Fo\F|| = ||-^||- The boundary of each patch has an even number of 
edges, namely the double of the corresponding cusp width. The familiar formula 

7= \- 1 

' 12 4 3 2 

for the genus 7 of To\E. in terms of the index n and the numbers rii of Fo-orbits of fixed- 
points of order i, can be interpreted in terms of the data of the quilt (see (6.2.3) of [57]), 
and we find in particular that if every patch of the quilt has at most 6 sides, then the 
genus will be or 1, and genus 1 only exceptionally. 

In particular, we're interested in one class of these F-actions (actually an SL2(Z)- 
action, but this doesn't matter). Recall that the braid group ^3 has presentation 

((Ti, (72 I cria2cri = 0-2(710-2) , (5.3a) 

and centre Z = (((Jicr2C7i)^) [7]. It is related to the modular group by 

B^/Z ^ PSL2(Z) , S3/(((7i(72(7i)^) ^ SL2(Z) . (5.36) 
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Fix a finite group G (we're most interested in the choice G = M). We can define a 
right action of ^3 on triples (5'i,^25^3) € G^ by 

{9i,92,gz)cri = (919291^,91,93) , i9i,92,93)(72 = (91,929392 ^,92) ■ (5.4a) 

We will be interested in this action on the subset of G^ where all Qi & G are involutions. 
The action (5.4a) is equivalent to a reduced version, where we replace (91,92,93) with 
(9192,9293) e G^. Then (5.4a) becomes 

(g,h)ai = (g,gh) , (g,h)a2 = (gh~^,h) . (5.46) 

These S3 actions come from specialisations of the Burau and reduced Burau representa- 
tions [7], respectively, and generalise to actions of on G'^ and We can get an 
action of SL2(Z) from the S3 action (5.4b) in two ways: either 

(i) by restricting to commuting pairs g, h; or 

(ii) by identifying each pair (g, h) with all its conjugates (aga~^,aha~^). 

Norton's SL2(Z) action of §5.1 arises from the S3 action (5.4b), when we perform both (i) 
and (ii). 

The quilt picture was designed for this SL2 (Z) action. The point of this construction 
is that the number of sides in each patch is determined by the orders of the corresponding 
elements g, h. If G is say a 6-transposition group (such as the Monster), and we take the 
involutions gi from 2A, then each patch will have < 6 sides, and the corresponding genus 
will be (usually) or 1 (exceptionally if at all). In this way we can relate the Monster with 
a genus-0 property. 

Based on the actions (5.4), Norton anticipates some analogue of Moonshine valid 
for noncommuting pairs. CFT considerations ('higher genus orbifolds') alluded to in §6 
suggest that more natural should be e.g. quadruples (g, g', h, h') e obeying ghg~^h~^ = 
h'g'h'-^g'-\ 

An interesting question is, how much does Monstrous Moonshine determine the Mon- 
ster? How much of M's structure can be deduced from e.g. McKay's Dynkin diagram 
observation, and/or the (complete) replicability of the T^,, and/or Norton's conjectures in 
§5.1, and/or Modular Moonshine in §5.4 below? A small start toward this is taken in [99], 
where some control on the subgroups of M isomorphic to Cp x Cp (p prime) was obtained, 
using only the properties of the series N(g, h). For related work, see Chapter 8 of [57]. 

5.3. Other finite groups. It is natural to ask about Moonshine for other groups. 
Indeed, the Hauptmodul for ro(2)-|- looks like 

q-^ + 4372? + 96256?^ + 12 40002g^ + • • • (5.5a) 

and we find the relations 

4372 = 4371 + 1 , 96256 = 96255 + 1 , 12 40002 = 11 39374 + 4371 + 2 • 1 , (5.56) 

where 1, 4371, 96255, and 11 39374 are all dimensions of irreducible representations of the 
Baby Monster B. Thus we find Moonshine for B! We will return to this example shortly. 
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Of course any subgroup of M automatically inherits Moonshine by restriction, but 
obviously this isn't interesting. Most constructions of the Leech lattice start with Mathieu's 
sporadic M24 (see e.g. Chapters 10 and 11 of [26]), and most constructions of the Monster 
involve the Leech lattice. Thus we are led to the following natural hierarchy of (most) 
sporadics: 

(1) M24 (from which we can get Mn, M12, M22, M23); which leads to 

(2) Coq = C2 X Coi (from which we get HJ, HS, McL, Suz, Co^, C02); which leads to 

(3) M (from which we get He, Fi22, i^«23, Fi'^^, HN, Th, B). 

It can thus be argued that we could approach problems in Monstrous Moonshine, by 
first addressing in order M24 and Coi, which should be much simpler. Indeed, the full 
VOA orbifold theory — i.e. the complete analogue of §5.1 — for M24 has been established 
in [38] (the relevant series Z{g, h) had already been constructed in [88]). 

Largely by trial and error. Queen [101] established Moonshine for the following groups 
(all essentially centralisers of elements of M): Coq, Th, ?>.2.Suz, 2.HJ, HN, 2.A7, He, M12 
(by e.g. '2.i7J' we mean C2 is normal and HJ is the quotient 2.i7 J/C2). In particular, to 
each element g of these groups, there corresponds a series Qgir) — + '^n{g)(r, 
which is a Hauptmodul for some modular group of moonshine-type, and where each g 1— > 
an{g) is a virtual character. For Th, HN, He and M12 it is a proper character. Other 
differences with Monstrous Moonshine are that there can be a preferred nonzero value for 
the constant term oq, and that although ro(A^) will be a subgroup of the fixing group, it 
won't necessarily be normal. We will return to these results next section, where we will see 
that many seem to come out of the Moonshine for M. About half of Queen's Hauptmoduls 
Qg for Coq do not cinSG clS cl McKay-Thompson series for M. Norton's conjectures in §5.1 
are a reinterpretation and extension of Queen's work. 

Queen never reached B because of its size. However, the Moonshine (5.5) for B falls 
into her and Norton's scheme because (5.5a) is the McKay-Thompson series associated to 
class 2A of M, and the centraliser of an element in 2A is a double cover of B. 

There can't be a VOA V = ©nKi with graded dimension (5.5a) and automorphisms 
in B, because e.g. the B-module V3 doesn't contain V2 as a submodule. However, Hohn 
deepened the analogy between M and B by constructing a vertex operator superalgebra 
VM^ of rank c = 23.5, called the shorter Moonshine module, closely related to (see e.g. 
[56]). Its automorphism group is C2 x B. Just as M is the automorphism group of the 
Griess algebra V2, so is B the automorphism group of the algebra {VM^)2- Just as is 
associated to the Leech lattice A, so is VM^ associated to the shorter Leech lattice O23, 
the unique 23-dimensional positive-definite self-dual lattice with no vectors of length 2 or 
1 (see e.g. Chapter 6 of [26]). The automorphism group of O23 is C2 x C02. 

There has been no interesting Moonshine rumoured for the remaining six sporadics 
(the pariahs Ji, J3, Ru, ON, Ly, J4). There will be some sort of Moonshine for any group 
which is an automorphism group of a vertex operator algebra (so this means any finite 
group [37]!). Many finite groups of Lie type should arise as automorphism groups of VOAs 
associated to affine algebras except defined over finite fields. But apparently all known 
examples of genus-0 Moonshine are limited to the groups involved with M. 

5.4- Modular Moonshine. Consider an element g e M. We expect from [101], [97], 
[36] that there is a Moonshine for the centraliser Cm(^) of g in M, governed by the g- 
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twisted module V''^{g). Unfortunately, V''^{g) is not usually itself a VOA, so the analogy 
with M is not perfect. Ryba found it interesting that, for (7 G M of prime order p, Norton's 
series N{g, h) is a McKay-Thompson series (and has all the associated nice properties) 
whenever h is p-regular (i.e. h has order coprime to p). This special behaviour of regular 
elements suggested to him to look at modular representations. 

The basics of modular representations and Brauer characters are discussed in sufficient 
detail in Chapter 2 of [31]. A modular representation p of a group G is a representation 
defined over a field of positive characteristic p dividing the order \G\ of G. Such represen- 
tations possess many special (i.e. unpleasant) features. For one thing, they are no longer 
completely reducible (so the role of irreducible modules as direct summands will be re- 
placed with their role as composition factors). For another, the usual notion of character 
(the trace of representation matrices) loses its usefulness and is replaced by the more subtle 
Brauer character f3{p): a complex- valued class function on M which is only well-defined 
on the p-regular elements of G. 

Theorem 7. [105], [17], [13] Let g eM be any element of prime order p, for any p 
dividing |M|. Then there is a vertex operator superalgebra = ®nei^^n defined over the 
finite field Fp and acted on by the centraliser Cmig)- If h E Cyiig) is p-regular, then the 
graded Brauer character 

R{g,h;r) :^ q-' ^ (3{^Vn)ih) q"^ 

equals the McKay-Thompson series Tghij). Moreover, for g belonging to any conjugacy 
class in M except 2B, SB, 5B, 7B, or 13B, this is in fact an ordinary VOA (i.e. the 'odd' 
part vanishes), while in the remaining cases the graded Brauer characters of both the odd 
and even parts can separately be expressed using McKay-Thompson series. 

By a vertex operator superalgebra, we mean there is a Z2-grading into even and odd 
subspaces, and for tt, v both odd the commutator in (4.2a) is replaced by an anticommu- 
tator. In the proof, the superspaces arise as cohomology groups, which naturally form an 
alternating sum. The centralisers Cmig) in the Theorem are quite nice: e.g. for g in classes 
2A, 2B, 3A, 3B, 3C, 5A, 5B, 7A, llA, respectively, these involve the sporadic groups B, 
Coi, Fi'24^, Suz, Th, HN, HJ, He, and M12. The proof for p = 2 is not complete at the 
present time. The conjectures in [105] concerning modular analogues of the Griess algebra 
for several sporadics follow from Theorem 7. 

Can these modular ^Vs be interpreted as a reduction mod p of (super) algebras in 
characteristic 0? Also, what about elements g of composite order? 

Conjecture 8. [13] Choose any g & M. and let n denote its order. Then there is 
a ^X-graded superspace = ®iaLz^Vi over the ring of cyclotomic integers Zfe^"^'/"]. It 

is often (but probably not always) a vertex operator superalgebra — in particular is an 
integral form of the Moonshine module V^. Each carries a representation of a central 
extension of Cmig) by On- Define the graded trace 

Big,h;T) = q-' ^^^vS^)^' ■ 
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Ifg, h commute and have coprime orders, then B{g, h; r) = Tgh{r). If all q- coefficients 
of Tg are nonnegative, then the 'odd' part of vanishes, and is the g-twisted module 
V\g) of [36]. If g has prime order p, then the reduction mod p of^V is the modular vertex 
operator superalgehra of Theorem 7. 

When we say is an integral form for V'', we mean that has the same structure 
as a VOA, with everything defined over Z, and tensoring it with C recovers V^'^. This 
remarkable conjecture, which tries to explain Theorem 7, is completely open. 

5.5. The geometry of Moonshine. Algebra is the mathematics of structure, and so of 
course it has a profound relationship with every area of mathematics. Therefore the trick 
for finding possible fingerprints of Moonshine in say geometry is to look there for modular 
functions. And that search quickly leads to the elliptic genus. 

For details see e.g. [55], [108], [112]. All manifolds here are compact, oriented and 
differentiable. In Thom's cobordism ring fi, elements are equivalence classes of cobordant 
manifolds, addition is connected sum, and multiplication is Cartesian product. The uni- 
versal elliptic genus 0(M) is a ring homomorphism from Q ® Q to the ring of power series 
in q, which sends n-dimensional manifolds with spin connections to a weight n/2 modular 
form of To (2) with integer coefficients. Several variations and generalisations have been 
introduced, e.g. the Witten genus assigns spin manifolds with vanishing first Pontryagin 
class a weight n/2 modular form of SL2(Z) with integer coefficients. 

Several deep relationships between elliptic genera and the general material reviewed 
elsewhere in this paper, have been uncovered. For instance, the important rigidity property 
of the Witten genus with respect to any compact Lie group action on the manifold, is a 
consequence of the modularity of the characters of affine algebras (our Theorem 4) [81]. 
The elliptic genus of a manifold M has been interpreted as the graded dimension of a 
vertex operator superalgebra constructed from M [111]. Seemingly related to this, [18] 
recovered the elliptic genus of a Calabi-Yau manifold X from the sheaf of vertex algebras 
in the chiral de Rham complex [85] attached to X. Unexpectedly, the elliptic genus of 
even-dimcnsional projective spaces P^"^ has nonnegative coefficients and in fact equals the 
graded dimension of a certain vertex algebra [86] ; this suggests interesting representation- 
theoretic questions in the spirit of Monstrous Moonshine. In physics, elliptic genera arise 
as partition functions of AT = 2 superconformal field theories [120]. Mason's constructions 
[88] associated to Moonshine for the Mathieu group M24 have been interpreted as providing 
a geometric model ('elliptic system') for elliptic cohomology Ell*(i?M24) of the classifying 
space of M24 [112], [39]. The Witten genus (normalised by 77^) of the Milnor-Kervaire 
manifold Mq , an 8-dimensional manifold built from the diagram, equals [55] (recall 
(3.5)). 

Hirzebruch's 'prize question' (p. 86 of [55]) asks for the construction of a 24-dimensional 
manifold M with Witten genus J (after being normalised by rj'^^). We would like M to act 
on M by diffeomorphisms, and the twisted Witten genera to be the McKay-Thompson 
series Tg. It would also be nice to associate Norton's series N{g,h) to this Moonshine 
manifold. Constructing such a manifold is perhaps the remaining Holy Grail of Monstrous 
Moonshine. 

Hirzebruch's question was partially answered by Mahowald and Hopkins [84], who 
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constructed a manifold with Witten genus J, but couldn't show it would support an 

effective action of M. Related work is [3], who constructed several actions of M on e.g. 
24-dimensional manifolds (but none of which could have genus J), and [73], who showed 
the graded dimensions of the subspaces of the Moonshine module are twisted A-genera 
of Milnor-Kervaire's manifold Mq (the A-genus is the specialisation of elliptic genus to 
the cusp ioo). 

There has been a second conjectured relationship between geometry and Monstrous 
Moonshine. Mirror symmetry says that most Calabi-Yau manifolds come in closely related 
pairs. Consider a 1-parameter family Xz of Calabi-Yau manifolds, with mirror X* given 
by the resolution of an orbifold X/G for G finite and abelian. Then the Hodge numbers 
h^'^{X) and h'^'^{X*) will be equal, and more precisely the moduli space of (complexified) 
Kahler structures on X will be locally isometric to the moduli space of complex structures 
on X*. The 'mirror map' z{q), which can be defined using the Picard-Fuchs equation [95], 
gives a canonical map between those moduli spaces. For example, xf + X2+xf + xf + 
z~^/'^xiX2X3X4 = is such a family of K3 surfaces, where G = C4 x C4. Its mirror map is 
given by 

z(q) = q- 104?^ + 6444?^ - 311744?^ + ISOISSSO?^ - 493025760?^ + • • • . (5.6) 

Lian-Yau [80] noticed that the reciprocal l/z{q) of the mirror map in (5.6) equals the 
McKay-Thompson series Tg{T) + 104 for g in class 2A of M. After looking at several other 
examples with similar conclusions, they proposed their Mirror- Moonshine Conjecture: The 
reciprocal I/2; of the mirror map of a 1-parameter family of K3 surfaces with an orbifold 
mirror, will be a McKay-Thompson series (up to an additive constant). 

A counterexample (and more examples) are given in §7 of [118]. In particular, al- 
though there are relations between mirror symmetry and modular functions (see e.g. [51] 
and [54]), there doesn't seem to be any special relation with the Monster. Doran [40] 'de- 
mystifies the Mirror-Moonshine phenomenon' by finding necessary and sufficient conditions 
for 1/z to be a modular function for a modular group commensurable with SL2(Z). 

6. The physics of Moonshine 

The physical side (perturbative string theory, or equivalently conformal field theory) 
of Moonshine was noticed early on, and has profoundly influenced the development of 
Moonshine and VOAs. This is a very rich subject, which we can only superficially touch 
on. The book [32], with its extensive bibliography, provides an introduction but will be 
difficult reading for many mathematicians (as will this section!) The treatment in [45] is 
more accessible and shows how naturally VOAs arise from the physics. This effectiveness of 
physical interpretations isn't magic — it merely tells us that many of our finite-dimensional 
objects are seen much more clearly when studied through infinite-dimensional structures 
(often by being 'looped'). Of course Moonshine, which teaches us to study the finite group 
M via its infinite-dimensional module V\ fits perfectly into this picture. 

A conformal field theory (CFT) is a quantum field theory on 2-dimensional space-time, 
whose symmetries include the conformal transformations. In string theory the basic objects 
are finite curves ('strings') rather than points ('particles'), and the CFT lives on the surface 



24 



traced by the strings as they evolve (coUiding and separating) through time. Each CFT is 
associated with a pair Vl, Vr of mutually commuting VOAs, called its chiral algebras [6]. 
For example, strings living on a compact Lie group manifold (the so-called Wess-Zumino- 
Witten model) will have chiral algebras given by affine algebra VOAs. The space Ti of 
states for the CFT carries a representation of Vl ® Vri and many authors have (somewhat 
optimisticly) concluded that the study of CFTs reduces to that of VOA representation 
theory. Rational VOAs correspond to the important class of rational CFTs, where 
decomposes into a finite sum ©Mj, ® of irreducible modules. The Virasoro algebra 
(4.3) arises naturally in CFT through infinitesimal conformal transformations. The vertex 
operator Y{(j), z), for the space-time parameter z = 6*+'^, is the quantum field which creates 
from the vacuum |0) G Ti the state |(^) G 7i at time t = — oo: |0) = lim^_^o^(f/'! z) |0). In 
particular, Borchcrds' definition [9] of VOAs can be interpreted as an axiomatisation of 
the notion of chiral algebra in CFT, and for this reason alone is important. 

In CFT, the Hauptmodul property of Moonshine is hard to interpret, and a less 
direct formulation like that in [116] is needed. However, both the statement and proof of 
Theorem 5 are natural from the CFT framework (see [45]) — e.g. the modularity of the 
series Tg and N{g, h) are automatic in CFT. This modularity arises in CFT through the 
equivalence of the Hamiltonian formulation, which describes concretely the graded spaces 
we take traces on (and hence the coefficients of our g-expansions) , and the Feynman path 
formalism, which interprets these graded traces as sections over moduli spaces (and hence 
makes modularity manifest). Beautiful reviews are sketched in [119], [120]. 

Because V''^ is so mathematically special, it may be expected that it corresponds to 
interesting physics. Certainly it has been the subject of some speculation. There will be 
a c = 24 rational CFT whose chiral algebra Vl and state space 7i are both V^, while 
Vr is trivial (this is possible because V'^ is holomorphic). This CFT is nicely described 
in [34]; see also [35]. The Monster is the symmetry of that CFT, but the Bimonster 
M I C2 will be the symmetry of a rational CFT with n = V^®V^. The paper [27] finds 
a family of D-branes for the latter theory which are in one-to-one correspondence with 
the elements of M, and their 'overlaps' {{g\\q'2^^o+^''~^^ \\h)) equal the McKay-Thompson 
series Tg-if^. However, we still lack any explanation as to why a CFT involving V'^ should 
yield interesting physics. 

Almost every facet of Moonshine finds a natural formulation in CFT, where it often 
was discovered first. For example, the 'No-Ghost' Theorem of Brower-Goddard-Thorn 
was used to great effect in [11] to understand the structure of the Monster Lie algebra m. 
On a finite-dimensional manifold M, the index of the Dirac operator D in the heat kernel 
interpretation is a path integral in supersymmetric quantum mechanics, i.e an integral over 
the free loop space CM = {7 : S'^ — > M}; the string theory version of this is that the index 
of the Dirac operator on CM should be an integral over C{CM), i.e. over smooth maps 
of tori into M, and this is just the elliptic genus, and explains why it should be modular. 
The orbifold construction of [36] comes straight from CFT (although [43] 's construction of 
predates CFT orbifolds by a year and in fact infiuenced their development in physics). 
That said, the translation process from physics to mathematics of course is never easy — 
Borcherds' definition [9] is a prime example! 

But from this standpoint, what is most exciting is what hasn't yet been fully exploited. 
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String theory tells us that CFT can live on any surface E. The VOAs, including the 

geometric VOAs of [59], capture CFT in genus 0. The graded dimensions and traces 
considered above concern CFT quantities ('conformal blocks') at genus 1: r i-^ e^^'^ maps 
EI onto a cylinder, and the trace identifies the two ends. But there are analogues of all this 
at higher genus [123] (though the formulas can rapidly become awkward). For example, 
the graded dimension of e.g. the CFT in genus 2 is computed in [117], and involves 
e.g. Siegel theta functions. The orbifold theory in §5.1 is genus 1: each 'sector' {g,h) 
corresponds to a homomorphism from the fundamental group 7? of the torus into the 
orbifold group G (e.g G — M) — g and h are the targets of the two generators of and 
hence must commute. More generally, the sectors will correspond to each homomorphism 
(fi : 7ri(E) G, and to each we will get a higher genus trace 2{(p), which will be a function 
on the Teichmiiller space Tg (generalising the upper half-plane EI for genus 1). The action 
of SL2(Z) on the N{g, h) generalises to the action of the mapping class group on 7ri(S) 
and Tg. See e.g. [4] for some thoughts in this direction. 

7. Conclusion 

There are different basic aspects to Monstrous Moonshine: (i) why modularity enters 
at all; (ii) why in particular we have genus 0; and (iii) what does it have to do with the 
Monster. We understand (i) best. There will be a Moonshine-like relation between any 
(subgroup of the) automorphism group of any rational VGA, and the characters xm-, and 
the same can be expected to hold of the orbifold characters Z in §5.1. 

To prove the genus property of the T^, we needed recursions obtained one way or 
another from the Monster Lie algebra m, and from these we apply Theorem 6. These 
recursions are very special, but so presumably is the genus property. The suggestion of 
[20] though is that we may be able to considerably simplify this part of the argument. 

Every group known to have rich Moonshine properties is contained in the Monster. 
Our understanding of this seemingly central role of M is the poorest of those three aspects. 

It should be clear from this review, of the central role VOAs play in our current under- 
standing of Moonshine. The excellent review [39] makes this point even more forcefully. It 
can be (and has been) questioned though whether the full and difficult machinery of VOAs 
is really needed to understand this, i.e. whether we really have isolated the key conjunction 
of properties needed for Moonshine to arise. CFT has been an invaluable guide thus far, 
but perhaps we are a little too steeped in its lore. 

Moonshine (in its more general sense) is a relation between algebra and number the- 
ory, and its impact on algebra has been dramatic (e.g. VOAs, V^^ Borcherds-Kac-Moody 
algebras). Its impact on number theory has been far less so. This may merely be a tem- 
porary accident due to the backgrounds of most researchers (including the mathematical 
physicists) working to date in the area. But the most exciting prospects for the future of 
Moonshine (in this writer's opinion) are in the direction of number theory. Hints of this 
future can be found in e.g. [121], [41], [14], [33], [52], [94]. 
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